Reinforcement theory

Results: 290



#Item
241Point-based value iteration: An anytime algorithm for POMDPs Joelle Pineau, Geoff Gordon and Sebastian Thrun Carnegie Mellon University Robotics Institute 5000 Forbes Avenue Pittsburgh, PA 15213

Point-based value iteration: An anytime algorithm for POMDPs Joelle Pineau, Geoff Gordon and Sebastian Thrun Carnegie Mellon University Robotics Institute 5000 Forbes Avenue Pittsburgh, PA 15213

Add to Reading List

Source URL: www.cs.cmu.edu

Language: English - Date: 2003-06-04 12:29:32
242Agendas for Multi-Agent Learning Geoffrey J. Gordon December 2006 CMU-ML[removed]School of Computer Science Carnegie Mellon University

Agendas for Multi-Agent Learning Geoffrey J. Gordon December 2006 CMU-ML[removed]School of Computer Science Carnegie Mellon University

Add to Reading List

Source URL: www.cs.cmu.edu

Language: English - Date: 2006-12-20 14:23:06
243Multiagent Learning in the Presence of Agents with Limitations Michael Bowling May 14, 2003 CMU-CS[removed]

Multiagent Learning in the Presence of Agents with Limitations Michael Bowling May 14, 2003 CMU-CS[removed]

Add to Reading List

Source URL: reports-archive.adm.cs.cmu.edu

Language: English - Date: 2003-07-21 09:06:25
244Approximate Solutions For Partially Observable Stochastic Games with Common Payoffs Rosemary Emery-Montemerlo, Geoff Gordon, Jeff Schneider School of Computer Science Carnegie Mellon University Pittsburgh, PA 15213

Approximate Solutions For Partially Observable Stochastic Games with Common Payoffs Rosemary Emery-Montemerlo, Geoff Gordon, Jeff Schneider School of Computer Science Carnegie Mellon University Pittsburgh, PA 15213

Add to Reading List

Source URL: www.cs.cmu.edu

Language: English - Date: 2004-06-25 15:30:50
245Policy-contingent abstraction for robust robot control  Joelle Pineau, Geoff Gordon and Sebastian Thrun School of Computer Science Carnegie Mellon University Pittsburgh, PA 15213

Policy-contingent abstraction for robust robot control Joelle Pineau, Geoff Gordon and Sebastian Thrun School of Computer Science Carnegie Mellon University Pittsburgh, PA 15213

Add to Reading List

Source URL: www.cs.cmu.edu

Language: English - Date: 2003-06-04 12:29:33
246No-Regret Learning and a Mechanism for Distributed Multiagent Planning Jan-P. Calliess Geoffrey J. Gordon

No-Regret Learning and a Mechanism for Distributed Multiagent Planning Jan-P. Calliess Geoffrey J. Gordon

Add to Reading List

Source URL: www.cs.cmu.edu

Language: English - Date: 2008-02-18 10:33:25
247ICML 2012 Handbook  International Conference on Machine Learning June 26 - July 1, 2012 Edinburgh, Scotland, UK

ICML 2012 Handbook International Conference on Machine Learning June 26 - July 1, 2012 Edinburgh, Scotland, UK

Add to Reading List

Source URL: icml.cc

Language: English - Date: 2012-06-14 13:31:31
248Selecting Computations: Theory and Applications  Nicholas Hay and Stuart Russell Computer Science Division University of California Berkeley, CA 94720

Selecting Computations: Theory and Applications Nicholas Hay and Stuart Russell Computer Science Division University of California Berkeley, CA 94720

Add to Reading List

Source URL: www.cs.berkeley.edu

Language: English - Date: 2012-10-04 09:08:48
249State Abstraction for Programmable Reinforcement Learning Agents David Andre and Stuart J. Russell Computer Science Division, UC Berkeley, CA[removed]fdandre,[removed]

State Abstraction for Programmable Reinforcement Learning Agents David Andre and Stuart J. Russell Computer Science Division, UC Berkeley, CA[removed]fdandre,[removed]

Add to Reading List

Source URL: www.cs.berkeley.edu

Language: English - Date: 2008-01-03 13:48:15
250Q-Decomposition for Reinforcement Learning Agents  Stuart Russell @.. Andrew L. Zimdars @..

Q-Decomposition for Reinforcement Learning Agents Stuart Russell @.. Andrew L. Zimdars @..

Add to Reading List

Source URL: www.cs.berkeley.edu

Language: English - Date: 2003-06-03 00:44:40